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Abstract. We investigate the power of the most important lower bound 
technique in randomized communication complexity, which is based on 
an evaluation of the maximal size of approximately monochromatic rect- 
angles, minimized over all distributions on the inputs. While it is known 
that the 0-error version of this bound is polynomially tight for deter- 
ministic communication, nothing in this direction is known for constant 
error and randomized communication complexity. We first study a one- 
sided version of this bound and obtain that its value lies between the 
MA- and AM-complexities of the considered function. Hence the lower 
bound actually works for a (communication complexity) class between 
MAnco-MA and AMHco- AM. We also show that the MA-complexity 
of the disjointness problem is Q(y/n). Following this we consider the 
conjecture that the lower bound method is polynomially tight for ran- 
domized communication complexity. First we disprove a distributional 
version of this conjecture. Then we give a combinatorial characteriza- 
tion of the value of the lower bound method, in which the optimization 
over all distributions is absent. This characterization is done by what we 
call a uniform threshold cover. We also study relaxations of this notion, 
namely approximate majority covers and majority covers, and compare 
these three notions in power, exhibiting exponential separations. Each 
of these covers captures a lower bound method previously used for ran- 
domized communication complexity. 



1 Introduction 

Communication complexity has grown into a central area in theoret- 



ical computer science since the seminal article by Yao [Y79], finding 
more and more applications that range from from VLSI resource-tradeoffs 
(e.g. p?79fl ) to data-stream computations (e.g. [3S02]), see the excel- 



lent monography [KN97] for pre-1997 applications. While communication 



complexity has been helpful by inspiring upper bounds in other models, 
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its main importance lies in the lower bounds it provides. Often also varia- 
tions of the techniques first devised for communication complexity become 
important in other areas, e.g. the field of branching programs (see [ W00|] ). 



Lower bounds for deterministic communication complexity are usu- 
ally not very hard to prove, but they are often not strong enough in 
applications. Considering randomized communication complexity is fre- 
quently necessary. The lower bound on the monotone circuit depth of the 



matching function given in [KW92] is an example where randomized com- 
munication complexity is used to prove a lower bound for some resource 
in a deterministic model. Furthermore, since randomized algorithms are 
considered standard today and any problem for which we can describe an 
efficient randomized algorithm is considered tractable, lower bounds on 
the randomized communication complexity are necessary to show that a 
communication problem is hard. Also, communication complexity is an 
interesting scenario to study the power of randomization. 

Basically all lower bounds on randomized communication complexity 
with bounded error are derived by considering properties of rectangles in 
the communication matrix. These proofs are usually done in two steps. 
First, so-called distributional communication complexity is considered. 
The distributional deterministic communication complexity with error e 
under a distribution [i on the inputs is the minimal complexity of a de- 
terministic protocol computing a function while erring with probability 
at most e under [i. According to the Yao-principle the randomized com- 
plexity of a problem equals the maximum over all distributions of the 
distributional deterministic complexity. Hence this first step can always 
be done without loss of generality (or degradation of the bounds). 

After choosing an appropriate distribution on the inputs one is left to 
analyze the deterministic distributional communication complexity. The 
2 C message sequences used by a communication c protocol partition the 
communication matrix into 2° rectangles)^ labeled with the output of the 
protocol on that message sequence. Proving a lower bound on the number 
of rectangles needed in such a partition is then done by showing that all 
1— e-correct rectangles are small. This approach or variants of it have been 
used by Yao | Y8S ] , Babai et al. [ BFS86|1 , Razborov R92[| , and adapted to 



partial functions also by Raz [|R99|| , so that virtually all important lower 
bound proofs (except |KS92j| ) for randomized communication complexity 
follow the described pattern. 

More precisely, the lower bound method (as described by Yao | Y83fl ) 



goes as follows: First one fixes a distribution that puts roughly as much 



1 For a definition of communication matrices and rectangles see Definition ^. 
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weight on the 1-inputs as on the 0- inputs of a function /. One decides 
whether one proves a bound on the size of rectangles containing pre- 
dominantly 1-inputs or 0-inputs. Then one shows that all rectangles of 
the desired type with size larger than l/2 fc must contain an e-fraction of 
wrongly classified inputs. As the consequence the randomized communi- 
cation complexity of / is Q{k). 

The main question motivating this paper is whether this lower bound 
method is tight, i.e., whether we may always prove lower bounds at most 
polynomially smaller than the actual randomized communication com- 
plexity using this method. This question is stated as Open Problem 3.23 
in [KN97]. Since the method already yields lower bounds, answering the 
question in the affirmative demands showing an upper bound on the ran- 
domized communication complexity in terms of the maximal value ob- 
tained by the lower bound method. 

It is well known that this is possible in the case e = 0, i.e., the 
corresponding lower bound method for deterministic protocols based on 
the size of monochromatic rectangles always yields results being at most 
quadratically smaller that the deterministic communication complexity. 
This result can be proved in two steps: first the 0-error rectangle bound is 
characterized via nondeterministic communication complexity (see The- 
orem 2.16 in | KN97[ ). Then the deterministic communication complex- 
ity is upper bounded by the product of the nondeterministic and co- 
nondeterministic communication complexities | AUY83f| , Theorem 2.11 in 
[KN97]. In this paper we consider the analogous question in the situation 
when the error probability is larger than 0. 

Note that all proofs in this paper are provided in the appendix. 



2 Power of the rectangle bound 

First let us fix some notation and give a formal definition of the main 
lower bound method investigated in this paper. 

Definition 1. Let fi be a distribution on {0, l} n x {0, l} n and a < 1/2. 

Then p is a-balanced for f : {0, l} n x {0, l} n -> {0, 1}, if 

a<n{f-\l))^{f- l m<l-a. 

1/2-balanced distributions are called strictly balanced, 1 / 'A-balanced 
distributions are called balanced. 
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Definition 2. The communication matrix of a function f : {0, l} n x 
{0, l} n — > {0, 1} is a matrix Mf with rows and columns each correspond- 
ing to {0, l} n , and with Mf(x, y) = f(x, y). 

A rectangle is a product set in {0, 1}™ x {0, l} n . Rectangles are labeled, 
a v-rectangle being labeled with v G {0, 1}. v{R) gives the label of R. 

The size of a rectangle (or any other set) R with regard to some distri- 
bution fi on {0, 1}™ x {0, l} n is fJ,(R) = J2 x ,yeR ^{ x iV)- Let err(R, fi, v) = 
— v)\R) denote the error of a v-rectangle R. 

We consider both one-sided and two-sided versions of the rectangle 
bound. 

Definition 3. 

size(fi,e, f,v) = max{^(i?) : err(R, fi,v) < e}, 
where R runs over all rectangles in Mf. 

bound^(f) = max log (1 /size e, /, 1)), 

where fi runs over all balanced distribution on {0, l} n x {0, 1}™. 
Furthermore 

bound e (f) = m.ax.{bound^ 1 \f),bound^ (-<f)}. 

We use the conventions bound(f) = 6ound 1 / 4 (/) and bound^ (/) = 
boundfJ A {f). 

Let us first note a fundamental property of the rectangle bound, 
namely error reducibility. 

Lemma 1. Let e< 1/2 - 

Assume boundi 1] (f) = k. Then bound { )\f) < 0(lk). 
Assume bound e (f) = k. Then bound e i(f) < 0(lk). 

The lemma is proved in appendix C. Also note that the definition 
is almost invariant with respect to the "balancedness" of the underlying 
distribution, see again appendix C for a proof. 

Lemma 2. Assume max^ log (1 /size (/x, e, /, 1)) = k where fi runs over 
all a-balanced distribution on {0, l} n x {0, l} n for some constant a. Then 

k = 0{bound^\f)), 

given that e < a/4. 
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For definitions of the different communication complexity modes con- 
sidered in this paper see appendix B. Note that all randomized modes of 
communication complexity are defined to have a public coin there. 

It is well known that bound(f) yields a lower bound on the randomized 
communication complexity of /, see [KN97[. So we have the following 
lower bound method. 

Method 1 (e-error randomized communication complexity) 

1. Pick a balanced distribution on {0, 1}" x {0, l} n . 

2. Pickv € {0,1}. 

3. Show that all 1 — e-correct v-rectangles in Mf have size < 2~ b . 

4. Then R t (f) > (2(b). 

As an example we give the following fact due to Razborov |R92[| , which 
will be used several times in this paper. Let DISJ(x, y) = A™ =1 (-iXi V-iyi) 
be the set disjointness problem. 

Fact 1 For DISJ there is an balanced distribution \i on {0, l} n x {0, l} n , 
so that every 0-rectangle R in {0, l} n x{0, l} n either satisfies p(R) < 2" 1371 
or err(R, /i, 0) > e for some constants (3, e > 0. 

In other words (using Lemma \^), R(DISJ) > bound <yl \DISJ) = 

a(n). 

We now try to determine exactly for which class of problems the lower 
bound method works. We show that bound^\f) lies between the MA- 
and AM-complexities of /. 

Theorem 1. 1. For f : {0, l} n x{0, l} n -> {0, 1} and e G [1/2 MA V\ 1/2- 
MA e (f) > ft (^boundP(f)) . 



2. For all f : {0, l} n x {0, l} n -» {0, 1} and e < 1/2 - Q(l): 

AM e (f) < 0(bound^(f) +log(l/e)). 

With this theorem and Fact |l| we can conclude a new lower bound in 
communication complexity. 

Corollary 1. 

MA(DISJ) = ^(v 7 ^), 

while 

N(^DISJ) = O(logn). 
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It seems unlikely that, but remains unknown whether DISJ has effi- 
cient ^4M-protocols. Actually no separation between larger classes than 
MA ^ co — MA is known within the communication complexity version 
of the polynomial hierarchy (see [ BFS8Gfl , polynomial time is replaced by 



poly logarithmic communication in this definition). Note that it is still 
open whether the polynomial hierarchy in communication complexity is 
strict. Actually we will give a lower bound in Theorem [7| showing that 
some explicit function is not contained in some even larger subclass of the 
polynomial hierarchy in communication complexity than MAUco — MA, 
yet that function itself probably is not included in the hierarchy, as op- 
posed to DISJ. 

We can conclude the following relations between Arthur Merlin and 
randomized communication and the lower bound method. 

Corollary 2. 

R(f) > n{bound(f)), 



R{f) > f2(max{MA{f),MA(^f)}) > Q \Jbmmd(f)\ , 
bound(f) > n(max{AM(f), AM(-if)}). 

If we could show that any function with both small AM- and co—AM- 
complexity also has small randomized complexity we could show that 
lower bound method 1 is always polynomially tight. If, on the other hand, 
R(f) — g(bound(f)) for some superpolynomial g and some /, then there is 
a superpolynomial separation between max{j4M(/), AM(—>f)} and R(f). 

The first attempt to prove tightness of bound(f) coming to mind uses 
the Yao-principle and switches to (non-) deterministic distributional com- 
plexity with error in the hope to employ similar techniques as in previous 
combinatorial results ||AUY83||, where D(f) < 0(N(f) ■ N(->f)) is shown 



for all / : {0, l} n x {0, 1}™ -» {0, 1}. 

It is well known (see Theorem 3.20 in [ KN97|] ) that 

Fact 2 

R e {f) = maxl%(f). 

We observe that by the same proof 
Lemma 3. 



AM € (f) = maxiVf (/). 

(J, 
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Hence if we could relate the e-error distributional nondeterministic 
complexity N^(f) + Nf?(—if) to the e-error distributional deterministic 
complexity D^(f) for all distributions [i we would have shown that the 
rectangle bound is always polynomially tight. But the approach does not 
work as shown in the next result. 

Theorem 2. There is a function WHICH : {0, l} n x {0, l} n -> {0, 1}, 

a balanced distribution fi on {0, l} n x {0, l} n ; and a constant e > so 
that 

Ng{WHICH),Ng{-^WHICH) = O(logn), 
D£(WHICH) = Q{n). 

So this first attempt to prove that the rectangle bound is tight, fails. 
Proving R(f) < poly (AM (f) + AM(-i/)) requires an argument not con- 
sidering the distributional complexity for all distributions separately. Also 
note that the distributions maximizing Nj^(f), N^{—if), D£(f) are in gen- 
eral not the same. 

The proof of Theorem ^ establishes a lower bound on the number of 
rectangles needed to partition (with small error) the communication ma- 
trix into rectangles, while errorfree covers (with overlapping rectangles), 
and hence large errorfree 1- and 0-rectangles exist. 

3 The rectangle bound and bounded error uniform 
threshold covers 

In this section we start another approach to prove that the lower bound 
method is tight. Instead of considering Arthur Merlin complexity we char- 
acterize the lower bound method itself combinatorially. 

Definition 4. A uniform threshold cover with parameters s,t for a com- 
munication problem f : {0, 1}™ x {0, 1}™ — > {0, 1} is a set of rectangles in 
the communication matrix of f with labels from {0, 1} ; so that for each 
input x,y at least t of the adjacent rectangles bear the correct label f(x,y) 
and at most s of the adjacent rectangles bear the wrong label 1 — f(x,y). 

A one-sided uniform threshold cover is as above, but only 1-labeled 
rectangles are used, and inputs with f(x, y) = 1 lie in at least t rectangles, 
while inputs with f(x, y) = lie in at most s rectangles. 

Let f : {0, l} n x {0, l} n — * {0, 1} be a communication problem. Let 
P denote the minimal size of a one-sided uniform threshold cover with 
parameters s,t for f. Then UT^(f) = [logP] is called the one-sided 
uniform threshold complexity of f . 
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Let f : {0, l} n x {0, l} n — * {0, 1} be a communication problem. Then 

UT s j(f) = max{£/T s j (/), UT^ (~~>f)} is called the uniform threshold 
complexity of f , and equals (within ±1) the logarithm of the minimal 
size of a uniform threshold cover for f. 

We will say that a uniform threshold cover with parameters s, t has 
bounded error, if t > 2s. 

The main features of bounded error uniform threshold covers are first, 
that the acceptance threshold is the same for all inputs, and secondly, the 
bounded error. 

Remark 1. Given a bounded error uniform threshold cover for / of size 
2 k with the parameters s, 2s, we can form all possible I = log(l/e) tuples 
of 1-rectangles, and take the intersections of the rectangles in such tuples 
into a new cover, labeled as 1-rectangles. Then we proceed analogously 
with the 0-rectangles. Clearly each input is in (2s) 1 = (1/e) ■ s l correctly 
labeled rectangles, and in at most s l incorrectly labeled rectangles. Hence, 
there is a value s' = (l/e)s l with UT ts ^ s ,(f) < 0(k ■ log(l/e)). 

We now characterize the lower bound method in terms of bounded 
error uniform threshold covers. 

Theorem 3. 1. (a) bound (1 \f) < 0(UT^ s (f)). 
(b) bound(f)<0(UT St2s (f)). 

2. (a) UT^if) < 0(boundP*(f) ■ logn). 
(b) UT ntTl 2{f) < 0{bound(f) ■ logn). 

Note that UT^ s (f) < 0(bound(f)) is not always true, as we will 
show after Theorem |] in Remark ||. 

So we have a quite natural version of covers that captures the tech- 
nique used in most of the lower bounds for randomized communication 
complexity. Showing that R{f) < poly(UT nn 2(f)) for all / : {0, l} n x 
{0, l} n — > {0, 1} would immediately show tightness of bound(f). We have 
been unable to prove such a result so far. Nevertheless, Theorem || turns 
the problem of showing tightness of the lower bound method into a com- 
binatorial one, not involving a maximization over distributions. Alterna- 
tively we may reformulate the problem as follows. 

Corollary 3. Let UT[R] : R -» {0, 1} for a rectangle R C {0, 1}^ x 
{0, 1}^ be the following communication problem (for some value t de- 
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pending on R): 



I 1 if Ei=i x { Ayi>t and Eil7v/2+i x i A Vi < 
»/ E J = / i *i A yi < >/t and E,=7V/2+i ^Ay t >i 



A protocol for computing UT[R] works under the promise that R contains 
only defined inputs. A protocol computes UT, iff for each R (that contains 
only defined inputs) the players (knowing R) compute UT[R] correctly. 



Note that for the communication problem UT, the threshold t is usu- 
ally much smaller than N. 

So it is sufficient (and necessary) to give an efficient randomized pro- 
tocol for the promise problem UT to show tightness of bound(f). 

4 Comparing different notions of threshold covers 

We now consider variations of the notion of threshold covers. The most 
immediate is a majority cover. 

Definition 5. A majority cover for a function f is a set of labeled rect- 
angles so that for each input the majority of the adjacent rectangles bears 
the correct label. Ties are broken in favor of f(x,y) = 1. 

Let PP(f) denote the logarithm of the size of a smallest majority 
cover for a function f. 

The above notion of majority covers corresponds to majority nonde- 
terministic protocols, which accept an input, whenever there are more 
nondeterministic computations leading to acceptance than to rejection: 
each computation in a nondeterministic protocol corresponds to a rect- 
angle. Majority covers are also equivalent to randomized protocols with 
error "moderately" bounded away from 1/2 as shown in |HR90| ] . 

Fact 3 There is a majority cover of size 2 k for f : {0, l} n x {0, l} n — > 
{0,1}, iff there are e,c with c + log(l/e) = 0(k), as well as a randomized 
protocol with error 1/2 — e > and communication c computing f (in 
the protocol the players are allowed to use a private source of randomness 
only). 



undef else. 



Then 




R(UT) < poly{\ogN) 

, 1}™ -» {0, 1} : R{f ) < poly(bound(f) log n). 
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Note that a randomized protocol can be viewed as a probability dis- 
tribution on deterministic protocols, and that each deterministic protocol 
induces a partition of the communication matrix into rectangles labeled 
with the function value. Then the union of all these rectangles is a uniform 
threshold cover for /, though not one with bounded error. The number of 
rectangles used is this cover is upper bounded by the number of message 
sequences used in the randomized protocol (here we use the fact that the 
randomized protocol can access private random sources only). 

Corollary 4. PP(f) < 0(k) ^ 3s : UT s>s+1 (f) < O(k). 

Hence, if we drop the bounded error feature from uniform threshold 
covers, we can as well drop the uniformity feature. 



It is shown in [K101] that majority covers have a strong connection to a 



lower bound method in communication complexity based on discrepancy. 

Definition 6. Let fi be any distribution on {0, l} n x {0, 1}™ and f be any 
function f : {0, l} n x {0, l} n -» {0, 1}. Then let 

disc^f) = max| M (P n f-\0)) - n(R n f^U))!, 

R 

where R runs over all rectangles in Mf. Denote disc(f) = min^ discu(f)- 



Fact 4 For all f : {0, l} n x {0, l} n -> {0, 1}: 

log(l/diac(/)) < PP(f) < 0(logl/dwc(/) + logn). 

Note that discrepancy l/2 k under some distribution essentially means 
that all rectangles with size at least l/2 fc / 2 have error at least l/2-l/2 fc / 2 . 
We can prove lower bounds for the PP-complexity in the following way. 

Method 2 (PP-complexity, discrepancy) 

1. Pick a distribution \i on {0, l} n x {0, l} n . 

2. Show that all rectangles R in M f have |/i(Pn/- 1 (0))-^(Pn/" 1 (l))| < 

3. Then PP(f) > Q(b). 

Consider the function MAJ : {0, l} n x {0, l} n -> {0, 1} with 

n 

MAJ(x, y) = 1 <=> ^(x, A yi ) > n/2. 
i=i 
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Obviously MAJ has a majority cover of size 0(n). It is easy to see that 
DISJ and its complement can both be reduced to MAJ . Hence we can 
easily separate majority covers from one-sided bounded error uniform 
threshold covers using Fact [l]: bound^(DISJ) = Q{n) which implies 
with Theorem | that UT^ S (DISJ) = Q{n). 

Corollary 5. PP(MAJ) = O(logn). 
UT^l(MAJ) = Q(n) all s. 

So a majority cover is in fact much stronger than even a one-sided 
uniform threshold cover with bounded error. Let us now consider a relax- 
ation of majority covers that has bounded error in some sense. Compared 
to bounded error uniform threshold covers we now drop the uniformity 
constraint on the threshold. 

Definition 7. An approximate majority cover is a majority cover in 
which for each input at least 3 /4 of the adjacent rectangles bear the correct 
label. Let APP(f) denote the logarithm of the size of a minimal approx- 
imate majority cover for f. 



Remark 2. Note that the parameters 1/4,3/4 can be improved to arbi- 
trary constant e, 1 — e by forming /c-tuples of rectangles and taking their 
intersections as the new approximate majority cover with k = log(l/e). 

The definition of approximate majority covers is similar to threshold 
computations on Turing machines in a class named BPP pat h as considered 
in [ HHT97 ]. We prefer our naming to BPP pat h, since the class has little 



similarity to BPP and is not defined in terms of paths here. It is shown 
in [HHT97] that BPP pa th contains MAUco — MA and is hence probably 



much more powerful than BPP. We immediately get a similar result for 
communication complexity using Theorems |l| and |||. 

Theorem 4. APP(f) < 0(UT^ s (f)) for all s. 

APP(f) < 0(UT^f)) for all s. 
APP(f) < mm{0(MA(f) 2 ),0(MA(^f) 2 )}. 



Remark 3. It is easy to see that PP(EQ) = (9(logn) for the equality 
function EQ(x,y) = 1 <^=^ x = y, since PP(f) > log D(f). Hence 
XJT^ S [EQ) = Q(\ogn). On the other hand b ound(EQ) = 0(1), since 
R(EQ) = 0(1) (see Example 3.13 in | KN97 | and note that random- 
ized protocols are defined to have a public coin here). Hence the relation 
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UTg2 S {f) — 0(bound(f) log n) from Theorem |3] cannot be improved to 
< 0(bound(f)), but possibly to < 0{bound{f) + logn). 

ylPP-complexity has an interesting connection to a lower bound method 
as follows. 

Theorem 5. // APP(f) = k then for all balanced distributions fi there 
is a rectangle of size 1/2°^ with error 1/4- 

If for all balanced distributions fi there is a rectangle of size l/2 k and 
error 1/4 then APP(f) < 0(k) + logn. 

Thus given that APP(f) is small, there is a large rectangle with 
small error for each distribution, sometimes a 1-rectangle, sometimes a 
O-rectangle. We are lead to the following lower bound method. 



Method 3 (APP-complexity) 

1. Pick a balanced distribution on {0, l} n x {0, l} n . 

2. Show that all 1 — e-correct rectangles in Mf have size < 2~ b . 

3. Then APP(f) > Q{b). 



Actually it has been shown by Yao in [ Y83| | that for some explicit function 



and some balanced distribution neither large 1 — e-correct 0-rectangles 
nor large 1 — e-correct 1-rectangles exist, hence he demonstrated that 
this function has linear APP complexity, which is a much stronger result 
than his conclusion that the function has linear randomized bounded error 
communication complexity. 

We give a separation result between the two types of covers, stating 
that approximate majority covers are actually much more powerful than 
one-sided bounded error uniform threshold covers and hence also than 
MA-protocols. 

Theorem 6. There is a function BOTH : {0, l} n x {0, l} n -> {0, 1} so 
that 

APP(BOTH) = O(logn), 

bound {l) (BOTH) > Q{n); bound {l) (^BOTH) > Q{n). 
Hence also 



UT^ s (BOTH) > fl(n); UT^BOTH) > f2(n), 
MA(BOTH) > f2(s/n); MA(^BOTH) > Q(^n). 
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So approximate majority covers are exponentially more powerful than 
one-sided bounded error uniform threshold covers for BOTH and its com- 
plement. In terms of the lower bound methods this means that for BOTH 
it is true that for every balanced distribution there is a rectangle of size 
l/poly(n) with constant error, but there exists a balanced distribution, 
where all 1-rectangles either have error 1/2 — o(l) or size l/2 fi ( n ), and 
there exists a balanced distribution, where all O-rectangles either have 
error 1/2 — o(l) or size 

To complete the picture we compare the power of APP and PP covers. 

Theorem 7. 

PP(MAJ) = O(logn), 

APP(MAJ) = Q{n). 

Note that the above result can be read as saying that for MA J for all 
balanced distributions there exists a rectangle with discrepancy l/poly(n) 
(having hence size \/poly[n) and error 1/2 — l/poly(n)), while there is a 
distribution \x where any rectangle with constant error has size 



5 Conclusions 



Virtually allg known lower bounds on randomized communication com- 
plexity in the literature can be seen as instances of methods 1, 2, or 3. 
We have shown that these three methods have exponential differences in 
power. It remains open whether method 1 is polynomially tight for ran- 
domized communication complexity. A way to show this is proving that 
the logarithm of the size of bounded error uniform threshold covers is 
polynomially related to the randomized communication complexity. This 
avoids arguing with the optimum (over all balanced distributions) of the 
lower bound parameter. We have shown in Theorem |2| that arguing for 
all distributions separately does not yield the desired result. 

Methods 2 and 3 have been characterized as more powerful versions 
of threshold covers. It is interesting that the rectangle based lower bound 
proofs can be understood in terms of these combinatorial objects that 
are only in the case of method 2 known to be directly related to standard 
communication complexity modes. 



2 A recent exception is a fl(^/n ) low er bound on the information which must be ex- 
changed in computing DISJ [ 5S02[ ] (and hence on the communication co mplex ity) . 



This quantity can also be lower bounded using the rectangle method, see K102] 
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We now note some further observations and open problems. The most 
significant open problem related to this paper is whether UT S; 2 S (f) and 
R(f) are polynomially related, resp. whether R(UT) < polyilogN). 

It can be shown with techniques as in [|HHT97| that every / with 
APP(f) = poly(logn) is in the polylog-communication complexity poly- 
nomial hierarchy, see |BFS86| for a definition of the latter. It is im- 
probable, however, that the same holds for all functions with PP(f) = 
poly (log n), since then the communication complexity version of the poly- 
nomial hierarchy would collapse. Let us note that the separation of the 
polynomial hierarchy for communication complexity is open. Hence method 
3 allows to show that some explicit function is not contained in the class 
of problems with APP(f) = poly(logn), the largest class of problems 
inside the polynomial hierarchy for which such a lower bound is known. 
Showing that this class is a proper subset of the hierarchy is open. It is 
also open, which methods might be applied to separate this hierarchy. 

Regarding the fine-structure of the relations between the discussed 
complexity measures there are several open problems. Is it possible to 
separate AM- from MA-complexity? This could be done by using the 
rectangle method to separate AM(f) from UT^gif)- But it is also possi- 
ble that UT^sif) is always polynomially related to AM(f). Furthermore 
any lower bounds for AM communication complexity are desirable, since 
they would probably need new techniques and lead to progress on the 
problem of showing lower bounds for higher classes in the polynomial hi- 
erarchy. Also a separation of MA(f) from UT^sif) would be interesting. 

As another issue the role of interaction in communication complex- 
ity is interesting. For nondeterministic communication 1-round protocols 
are optimal, not so for randomized, deterministic, and even communi- 
cation with limited nondeterminism | KN97 , |K198|1 . Clearly 1-round AM- 
protocols are also optimal, but this seems unlikely for MA-protocols. A 
candidate problem to establish this conjecture would be the majority 
of the outcomes of pointer jumping on yjn paths of length k, with the 
promise that 3/4- ^Jn paths lead to the same output. A randomized proto- 
col with k rounds and 0(k\ogn) communication can solve this problem, 
but M^-protocols of complexity o(^/n) using k — 1 rounds possibly not. 
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A Organization of the rest of the paper 

In appendix B we formally define the different modes of communica- 
tion complexity considered in this paper, appendix C provides proofs of 
elementary properties of the rectangle bound. In appendix D we give 
the proofs concerning the comparison of the rectangle bound with MA- 
and AM-communication complexity, and the proof of Theorem [2|. Ap- 
pendix E shows the equivalence between bound(f) and bounded error 
uniform threshold covers for /. Appendix F shows equivalence between 
lower bound method 3 and approximate majority covers, and gives sepa- 
rations between the three lower bounds methods resp. the three types of 
threshold covers. 



B Definitions 

We employ the following definitions of communication complexity. 

Definition 8. Let f : {0, l} n x {0, l} n -> {0, 1} be a function. In a 
communication protocol players Alice and Bob receive inputs x, y from 
{0, l} n each. Their goal is to compute f(x,y). To this end the players 
exchange binary encoded messages. The communication complexity of a 
protocol is the worst case number of bits exchanged. 

The deterministic communication complexity D(f) of a function f is 
the complexity of an optimal protocol computing f . 

In a nondeterministic protocol for a Boolean function f the players are 
allowed to guess some bits and communicate according to a different de- 
terministic strategy for each guess string. An input is accepted iff at least 
one computation accepts. The nondeterministic guesses are private and 
accessible to the guessing player only. The nondeterministic communica- 
tion complexity N(f) is the complexity of an optimal nondeterministic 
protocol computing f. 

In a randomized protocol for a Boolean function f the players can 
access a public source of random bits. They can communicate according 
to a different deterministic strategy for each value of the random bits. It is 
required that for each input the correct output is produced with probability 
1 — e for some e < 1/2. The randomized communication complexity R e (f) 
is the complexity of an optimal randomized protocol computing f with 
error probability e. 

Arthur Merlin computations have been introduced in j B8t , B~M8Sd . In 



an Arthur Merlin (AM) protocol the players may first access a public 
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source of random bits and read an arbitrarily long random string. Af- 
ter this phase they start a nondeterministic protocol. A function f : 
{0, l} n x {0, l} n — > {0,1} is computed if for all x,y with f(x,y) = 1 
with probability 1 — e over the random bits the nondeterministic protocol 
accepts (i.e., there is a guess string makes players accept), while for all 
x,y with f(x,y) = with probability 1 — e over the random bits the non- 
deterministic protocol does not accept (i.e., there is no guess that makes 
the players accept). The communication complexity of an Arthur Merlin 
protocol is the maximum (over the random bits) of the complexities of 
the nondeterministic protocols. Let AM e (f) denote the complexity of an 
optimal Arthur Merlin protocol for f with error e. 

In a Merlin Arthur protocol for a function f : {0, l} n x {0, l} n — > 
{0, 1} the players first make a nondeterministic guess of some length k 
known to both players. Then the players perform a randomized protocol. 
It is required that for all x,y with f(x,y) = 1 there is a value of the 
guess, so that the protocol accepts with probability 1 — e, while for all 
x,y with f(x,y) = there is no value of the guess, so that the protocol 
accepts with probability larger than e. The complexity of a Merlin Arthur 
protocol is given by the maximum complexity of the communication ( over 
the guesses and the coin tosses) plus k. Let MA e (f) denote the complexity 
of an optimal Merlin Arthur protocol for f with error e. 

In case the subscript fixing the error is dropped we set the error to 
1/4. 

Note that an Arthur Merlin protocol is an interactive proof system 
with verification performed by a communication protocol. Arthur chal- 
lenges Merlin to provide a proof that f(x,y) = 1, this proof is verified by 
Alice and Bob. 

A Merlin Arthur protocol uses a randomized protocol to check a fixed 
proof, whose length is included in the communication cost. The Mer- 
lin Arthur model would be ill-defined, if we would simply require the 
nondeterministic guess to be private and coming without cost. In this 
case Alice could simply guess Bob's input nondeterministically, and then 
use a randomized protocol for the equality function with O(l) commu- 
nication to test if her guess was rig ht (see Example 3.13 in [|KN97| |, 
note that public coin in the randomized protocol). If so, she can com- 
pute any function on x, y and announce the result. Hence any function 
/ : {0,l} n x {0,l} n — ► {0,1} would have Merlin Arthur communication 
complexity O(l) under such a definition. 

In lower bound proofs for randomized complexity one often applies 
the Yao-principle that states a relation between the complexity in the 
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randomized setting (with small error probability for every input) and 
the complexity in the deterministic setting where correctness is only de- 
manded with high probability over some distribution on the inputs. 

Definition 9. A deterministic protocol has error e under some distribu- 
tion \x on the inputs, if the probability that the protocol errs is e. 

The distributional deterministic complexity of f : {0, 1}™ x {0, 1}™ — > 
{0, 1} is D^{f), the minimal complexity of any deterministic protocol with 
error e under \x. 

A nondeterministic protocol has error at most e under some distribu- 
tion fx on the inputs, if the probability (under //) of the set of accepted 
0-inputs and nonaccepted 1-inputs is at most e. 

The distributional nondeterministic complexity of f : {0, l} n x{0, l} n - 
{0, 1} is N^(f), the minimal complexity of any nondeterministic protocol 
with error e under /i. 

Nondeterministic communication complexity is related to a specific 
type of covers fKN97j . 

Fact 5 Let Cov^\f) denote the minimum number of monochromatic (0- 
error) 1-rectangles in a set {R±, . . . ,R C }, so that / _1 (1) = Uf =1 i?j ^ 0. 
Then N{f) = [log Cov^ (/)] . 

C Properties of the rectangle bound 

Proof of Lemma |l]. Assume bound^\f ) = k. Hence for all balanced 
distributions there is a 1 — e-correct 1-rectangle of size l/2 k at least, t IX 
any balanced distribution /i. We construct a rectangle of error e l and size 
2-0(ki) £ Qr ^ i nc l uc tivery. 

Let [Iq = ji. First we take a rectangle R\ with error < e and size 
si > l/2 k guaranteed by our assumption for hq. In case Ri has no error at 
all we are done. Otherwise we construct a new distribution /xi as follows: 
For all x,y R\ we set ^i(x,y) = 0. /ii is then normalized to a strictly 
balanced distribution by multiplying no(x,y) by some factor po when 
f(x,y) = and multiplying fio(x,y) by a factor p\ when f{x,y) = 1. 

Note that since //(/ _1 (0)|i?) < e we have p > l/(2e) and 1/(2(1 - 
e )) > Pi > 1/2. Then we can pick a rectangle R2 with error e and size 
S2 > l/2 fc according to 

Now we compute the size of R2 according to fi and also its error on 
ji. By concentrating [i\ on R\ we have increased the weights of x, y E R\ 
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uniformly by a factor of (1/si) < 2 fc . Then we have balanced the distri- 
bution by multiplying 1-inputs' weights with p\ and 0-inputs' weights by 
Po- 

So the weight of 0-inputs in i? 2 according to \i is at most 

s\ ■ (1/po) • e • s 2 < si • 2e 2 • s 2 . 

The weight of the 1-inputs is at least 

si • (l/p x ) • (1 - e) • s 2 > si • 2(1 - e) 2 • s 2 . 

The size of i? 2 is at least si • s 2 > l/2 2fc . 

Assume that e > 1/4, then e = 1/2 — 5 for some (5 < 1/4. In this case 

e-(iW, 1) < + < i " I7 ^J < 1/2 - (3/2)* 

Repeating this 0(1/5) = 0(1) times reduces the error to less than 1/4. 

If e < 1/4, then R2 has error at most 2e 2 . Iterating the construction 
0(1) times yields the first part of the lemma. Arguing analogously for 
O-rectangles yields the second part. □ 

Proof of Lemma pj. Let the bound be k when pi runs over all a- 
balanced distributions on the inputs. 

First assume that a < 1/4. Clearly k is an upper bound on k' = 
boundS^ '(/) in this case. We have to show that also k = 0(k'). 

Let n be an a-balanced distribution on the inputs. We can balance 
the weights of 1-inputs and 0-inputs by multiplying the weights of the 
0-inputs by some pq and the weights of the 1-inputs by some p% so that 
a strictly balanced distribution fj,' is obtained. We take a rectangle R of 
error e and size l/2 k according to //. Assume that a < /i(/ _1 (l)) < 1/2. 
Then p\ > 1. Consequently the size of R is slightly smaller according to 
\i than to //, and the error is possibly slightly smaller, too. In the other 
case a < ^(/~ 1 (0)) < 1/2 the opposite occurs, namely the rectangle is 
possibly slightly larger, and the error is larger. But in any case the size 
and the error are changed by constant factors 0(a) resp. 0(1/ a) only. If 
the error of R is too large we can use the previous lemma to reduce the 
error probability while decreasing the size. 

The case l/4<a<l/2is handled similarly. □ 

D The rectangle bound versus Arthur Merlin 
communication complexity 

We now prove the relations between Arthur Merlin communication and 
the lower bound method based on rectangle size. 
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Proof of Theorem [l], part 1. We are given a Merlin Arthur proto- 
col with complexity c and error e < 1/4 for the function /. We show that 
under this condition we can find a large 1 — e-correct 1-rectangle. Recall 
that the complexity of the Merlin Arthur protocol includes the "commu- 
nication" done by Merlin (who is guessing nondeterministically) plus the 
communication by the players Alice and Bob. Let cm be the length of the 
longest guess given by Merlin over all inputs. We will call such a guess 
string a proof. Let cp denote the length of the longest communication 
between Alice and Bob occuring during any run of the protocol. Clearly 

CM,Cp < C. 

It is possible to reduce the error probability of the protocol to l/2 2c 
by repeating the probabilistic part of the protocol O(c) times indepen- 
dently and taking the majority output, for any fixed proof of Merlin. Let 
P(x, y, z) denote the (random) output of the protocol for inputs x, y and 
Merlin's proof z. The acceptance properties of the protocol are then: 

If f(x,y) = 1 then there is a proof z so that P(x,y,z) accepts with 
probability 1 - l/2 2c . 

If f(x, y) = then for all proofs z, P(x, y, z) accepts with probability 
at most l/2 2c . 

Note that the communication among the players in the new protocol 
is bounded by k = 0(c ■ cp) = 0(c 2 ). 

There are at most 2 CJU different proofs. If we fix such a proof z there 
is a set s z of 1-inputs that is accepted on this proof, i.e., for which the 
protocol accepts with high probability on this proof. In this way the set 
of 1-inputs is covered by 2 CM subsets s%, . . . , S2 c m ■ 

Let fj, be any balanced distribution over the inputs. For each such 
distribution we can find at least one proof z so that fi(s z ) > 1/2 CM . We 
fix such a proof. This turns the Merlin Arthur protocol into a random- 
ized protocol so that a subset s z of 1-inputs of weight 1/2 CM is accepted 
with probability 1 — l/2 2c each and no 0-input is accepted with proba- 
bility larger than l/2 2c . The other 1-inputs are accepted with uncertain 
probability. 

We restrict fi to the inputs in s z U / _1 (0), by setting the weight of all 
other inputs to and normalizing to a distribution yl '. Clearly the error 
of the protocol under // is at most l/2 2c . Furthermore note that either 

fjf(x, y) = or //(x, y) = 9(n{x, y)), (1) 

since y is balanced. 

Given such a randomized protocol we may also fix its random choices 
and get a deterministic protocol (like in the easy direction of the Yao- 
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principle). This yields a deterministic protocol which on expectation has 
error l/2 2c (under //). Consequently there exists a deterministic protocol 
with error l/2 2c under // and communication k = 0(c 2 ). 

A deterministic protocol with communication k easily leads to a set R 
of P = 2°( fc ) pairwise disjoint rectangles labeled with the protocol output 
that partition the communication matrix. We show that there exists a 
large 1-rectangle with small error. 

Assume that all rectangles which are larger than 1/(2 C+1 • 2P) have 
error larger than 1/2 C . Then the success probability of the protocol on 
// is upper bounded as follows. The small rectangles contribute at most 
P- 1/(2 C+1 -2P) ■ 1 < 1/2 C+2 , the large rectangles all have success at most 
1 — 1 /2 C and so the overall success probability is at most 1 — 1/2 C + 1/2 C+2 , 
too small in comparison to the maximum error l/2 2c . Hence there is a 
1-rectangle of size 1/(2 C+1 • 2P) > 2-°^ with error at most 1/2 C < e 
according to //. If we switch from // to fj,, then the size of a rectangle 
cannot decrease (compared to /i) by more than a constant factor due to 
(H). It is also easy to see that the error of the rectangle cannot increase 
when switching from // to /i. Consequently bounds (/) < 0(c 2 ). □ 

Now we relate bound^{f) to AM(f). 

Proof of Theorem [I], part 2. Assume that bound^p (f) = c. Then 
bound^ 8 < 0(c). For all balanced distributions /i there is a rectangle 

with error at most e 4 /8 and size at least s > l/2°( c \ Also recall 
that AM e (f) = max^JV^/), where /i runs over all distribution on the 
inputs, due to Lemma |||. We use a greedy algorithm to construct a cover 
of the 1-inputs to / with error e containing at most 2°^ ■ (1/e 2 ) -log(l/e) 
rectangles for any [x. 

So let [i be some distribution on {0, l} n x {0, l} n . We distinguish 
three cases. First consider the case that \i(j- x {\)) < e 2 . In this case 
clearly N^if) = by a protocol that never accepts. 

Next consider the case that /x(/ _1 (0)) < e 2 . In this case N^if) = 
by a protocol that always accepts. 

Now consider the case that //(/ _1 (1)) > e 2 and ^(/ _1 (0)) > e 2 . Then 
we can still find a good cover as follows. We first show that for each such 
distribution a relatively large rectangle with small error exists. Then we 
use a greedy approach to find a cover. 

First we show how to find good rectangles. We (strictly) balance the 
distribution by multiplying the weights of 1-inputs by some value pi and 
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multiplying the weights of 0- inputs by some value pq. Clearly 

2(l- e 2)>l ; l> 2e 2 . 
Pi Po 

For the resulting strictly balanced distribution fj there is a 1-rectangle 
R^i of size s having error e 4 /8 at most. Then the fi- weight of 1-inputs in 
R^i is at least 

s ■ (1 - e 4 /8) • (1/pi) > a • (1 - e 4 /8) ■ 2e 2 > se 2 . 

Furthermore the ^-weight of 0-inputs in R^i is at most 

8 • (e 4 /8) • (1/po) < 8 • e 4 • (1 - e 2 )/4 < se 4 /4. 

We next construct for fi a cover using a greedy approach. 

1. Let = n- 

2. For ^ find a rectangle i?« = that contains 1-inputs of weight at 
least se 2 and 0-inputs of weight at most se 4 /4. 

3. Put Ri into the cover. 

4. Remove the weight from all 1-inputs in Ri and uniformly increase the 
weights of the remaining 1-inputs by some appropriate factor q(i). 
[Note that this does not affect the balance of the distribution.] Let 

denote the resulting distribution. 

5. Stop, when the set of remaining 1-inputs not covered so far has weight 
< e/2 according to \i. 

6. Otherwise continue with 2. and set i := i + 1. 

Clearly the algorithm finds a set of rectangles so that all but a set 
of weight e/2 of the 1-inputs is covered. In the worst case the weight 
of any 1-input is increased by a factor of ^(l) ■ • ■ q(i) < (3/4)/ (e/2) = 
(3/2) /e during the course of the algorithm. Hence the weight of 1-inputs 
in Ri according to /x is at least se 2 • e • (2/3), while the weight of 0- 
inputs is at most se 4 /4. The error of Ri is thus at most (3/8)e. Since this 
holds for all rectangles, the weight of 0-inputs in the cover is at most a 
fraction of (3/8) e of the weight of all 1-inputs covered, which is at most 
(3/4) • (3/8)e < e/2. The weight of 1-inputs not covered is e/2. So the 
obtained cover has error e. 

Now we have to analyze the size of the obtained cover. Each step covers 
at least a se 2 fraction of the remaining 1-inputs. Hence the proportion of 
not yet covered 1-inputs according to \i after k steps is 

w k <(l- se 2 ) k . 
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The algorithm stops if this is smaller than O(e), hence k < 0(l/(se 2 ) ■ 
log(l/e)), and N? (/) < 0(c + log(l/e)). 

So indeed for all fj. we have iVf(/) < 0{bound { eHf) + log(l/e)), and 
hence ^M £ < 0(6o«n^ 1} (/) + log(l/e)). □ 

Proof of Theorem ||]. We first define the function that has both small 
nondeterministic and co-nondeterministic complexity with error under 
some distribution, but large deterministic complexity for some constant 
error under the same distribution. 

Let WHICH((x 1 ,x 2 ),(y 1 ,y 2 )) = 

( 1 if ^DISJ{ Xl , y{) = 1 and ^DISJ(x 2 , y 2 ) = 
\ if -,DISJ{x u yi) = and -^DISJ(x 2 , jte) = 1 
I otherwise. 

We employ the following more specific and optimized version of Fact |, 
which follows from some fine-tuning of the result in |R92f| . 

Fact 6 Let u a be the distribution on {0, l} n x {0, l} n which is uniform on 
{(x,y) : \x\ = \y\ = n/4,\x n y\ = 0} (and elsewhere), and let v r be the 
distribution on {0, l} n x {0, l} n which is uniform on {(x,y) : \x\ = \y\ = 
n/4, |xny| = 1} (and elsewhere). Let u be the distribution on {0, l} n x 
{0, l} n , which is defined by v(x, y) = (3/4) • v a (x, y) + (1/4) • v r (x, y). 

Then for any constant 5 > there is a constant /3(d) > 0, so that any 
rectangle R either has size 2~@^ n , or v(x,y : x D y 7^ | R) > 1/4 — 5, 
i.e., 

u({(x, y) : x n y + 0} n R) > (1/4 - 5) ■ v{R) - 2~^ n . 

Now to the definition of the distribution on the inputs. In the distri- 
bution v x u two instances {x\,yi) and (x 2 ,y 2 ) are chosen independently 
from v. 

For the hard distribution /i on inputs we pick inputs as in u x v, 
but inputs with DISJ(x\,yi) = DISJ(x 2 ,y 2 ) are removed, so that the 
function value on the two instances differs with probability 1 (// is nor- 
malized to a distribution after this removal). On \i the task of a protocol 
is to determine on which of the two set pairs -^DISJ is true. Note that 
either fJ,(x 1 ,x 2 ,y 1 ,y 2 ) = Q{y x u(x 1 ,x 2 ,y 1 ,y 2 )) = &(u(xi,y{) ■ v(x 2 ,y 2 )) 
or /j,(xi,x 2 ,yi,y 2 ) = 0, for all inputs (xi, x 2 , yi, y 2 ) from {0, l} 4n , since 
Prob v (DISJ(x\,yi) = 1) = 3/4. Also note that /i is strictly balanced. 

There is a simple nondeterministic protocol for WHICH making no 
error under the distribution /i. One can simply use a protocol for -^DISJ 
on the first instance. This covers all 1-inputs of WHICH , but accepts 
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no 0-input with weight larger than 0. Analogously we can find a protocol 
for ^WHICH under p. So (WHICH), Ng (WHICH) < logn + 1. 

Now we turn to the complexity of a deterministic protocol with error. 
Such a protocol with communication c immediately yields a set of P = 

2 0(c) 

pairwise disjoint rectangles labeled with values 0,1, so that a 1 — e 
fraction of all inputs according to fx are in correctly labeled rectangles. 
Call the 1-rectangles R\, . . . ,Rp, the 0-rectangles S\, . . . ,Sp. 

We show that such a partition can only exist if c = fi(n). So for 
the sake of contradiction assume that c < for some arbitrarily small 
constant 7 we can choose later. 

Note that the difficulty for a deterministic protocol is that the cor- 
responding cover consists of disjoint rectangles, even on the inputs with 
fj,(x±, x 2 , y±, 2/2) = 0. In case the reader would prefer to loosen this re- 
striction and require a protocol to be deterministic only on those inputs 
with n(xi, x 2 , 1/2) > we can still give those inputs some small prob- 
ability, so that the above nondeterministic protocols would have small 
error, while the following lower bound on deterministic protocols would 
be unchanged. 

We use the following notation. An input (x±, X2, yi, 2/2) is in quad- 
rant A, if ^DISJ(x\,y\) = and ~^DISJ(x2,y2) = 0, in quadrant 
B, if ^DISJ(x\,yi) = 1 and ~^DISJ(x2,y2) = 0, in quadrant C, if 
-^DISJ(xi,yi) = and -^DISJ(x2,y2) = 1, and in quadrant D, if 
-^DISJ(xi,yi) = 1 and ~^DISJ(x2,y2) = 1- Note that inputs in quad- 
rants A and D have probability under \x. Under u x u quadrants B and 
C have weight 3/16, quadrant A has weight 9/16 and quadrant D has 
weight 1/16. 

Let Ri be a 1-rectangle and (xi,x 2 ,yi,y2) £ R% with ->DISJ(xi,yi) = 
1. Then the set Ri(x±,yi) = {x2,y2 ■ x±,x 2 ,yi,y2 £ Ri} is a rectangle in 
{0,1}" x {0,l} n . Let 



n(R i (x 1 ,y 1 ) I x 1 ,y 1 ) 



v({{xi,yi)} x Ri(xi,yi)) 
M ({(xi,yi)}x{0,l}-x{0,l}-) 



denote the weight of Ri(x±,yi) relative to the inputs with fixed x\,y\. 
Note that 

M({(*i,j/i)} x {0,1}™ x {0,1}") = 9{y{x 1 ,y 1 )). 

Also 

t*({(xi,yi)} x Ri(xi,yi)) < 0(u(xi,yi) ■ v{R i {x 1 ,y 1 ))), 
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since all inputs xi,X2,yi,?/2 with ->DISJ(x2,y2) = 1 have weight in \x 
and all x\,X2,y\,y2 with ->DISJ(x2, 2/2) = have weight fJ,(x±, X2, yi, 2/2) = 
®(y{xi,y x ) ■ u(x 2 , y 2 ))- Then 

u(R i (x 1 ,y 1 )) = f2(fj,(R i (x 1 ,y 1 ) \ x 1 ,y 1 )). (2) 

We will show that each large Ri(x\,y\) must contain many inputs 
X2-,y2 with ->DISJ(x2,y2) = 1- While this does not create any error in 
the rectangle Ri, this shows that Ri occupies a significant portion of quad- 
rant D. If this is true for many rectangles Ri then a situation is reached 
in which the majority of quadrant D is occupied. Since a symmetric ar- 
gument applies to the O-rectangles we are lead into a contradiction, since 
the rectangles are not allowed to intersect nontrivially. 

We set e = 5/2 = 1/17 fixing the protocol's error and the con- 
stant from Fact ^, and choose 7 < (3(5) /2. Let xi,yi be an input with 
^DISJ(x uyi ) = 1. If M(fli(zi,Vi)Kyi) > ^(2~ 7n ), then with (|) 
v(Ri(xi,yi)) > 2~~@^ n , and hence the fraction of ^2,2/2 £ R%(xi,yi) with 
->DIS J(x2i 2/2) = 1 is at least 1/4 — 5 according to v. In other words, 
the proportion of 0-inputs to 1-inputs of -iDISJ(x2,y2) in Ri(x\,y\) is 
3/4 + 5 to 1/4 -5. 

Hence {(x\, y%)} x Ri(x\,yi) occupies at least a (1/4 — <5)/(3/4 + <5) > 
1/3 — 2(5 fraction of the weight it covers in quadrant B (according to v x v) 
also on the inputs in quadrant D (according to v x v). Recall that onyxj/ 
quadrant D has weight 1/16 and quadrant B has weight 3/16. 

We next show that a fraction of 1 — 2e of all 1-inputs of WHICH 
(resp. of quadrant B) is covered by Ri(xi,yi) with (J,(Ri(xi,yi)\xi,yi) > 
Q{e ■ 2~T n ) and hence v{Ri{x\,y{f) > 2~^ n (using (|)). Then in quad- 
rant D weight at least (l-2e)-(l/3-2<5)-3/16 > 1/16-5/2 > (l/2)-(l/16) 
is occupied by the rectangles Ri (according to v X v). Since the same 
holds for the rectangles Si we get at least one input in quadrant D that 
is covered twice, a contradiction to the requirements on those rectangles. 
Therefore 7 > (3(5)/ 2 = 0(1) and hence c = Q(n). 

The weight of 1-inputs (according to n) that are in rectangles Ri(x\, y\) 
with fi(Ri(xi,yi)\x\,yi) > e / const ■ 2~ in (and ->DISJ(xi,yi) = 1) 
is at least 1 — 2e, since at most an e fraction of 1-inputs is not cov- 
ered and the fraction of 1-inputs covered by smaller rectangles can be 
bounded as follows. Each small rectangle Ri(x\,yx) can cover 1-inputs 
of weight at most e/const ■ 2~~ in ■ ix({(x\,y\)} x {0, l} n x {0,1}™), and 
there are at most 2 7n • S different Ri(x\,y±) with ->DISJ(xi,yi) = 
1, where S = \{(x,y) : \x\ = \y\ = n/4, \x fl y\ = 1}\. Note that 
//({(xi,yi)} x {0, l} n x {0, l} n ) = 0(1/ S). So the 1-inputs covered by 
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small rectangles have weight at most 



e/const-2-^ n ■ fx{{{ Xl , yi )} x {0, l} n x {0, l} n ) • 2^ n • S < e. □ 

E Uniform threshold covers and the rectangle bound 

Proof of Theorem |||, part 1. Assume that C/T S 2 S (/) — k for some 

s. Then following Remark [l], UT^) s ,(f) < O(k) for an arbitrarily small 
constant e and some s'. 

Given a one-sided bounded error uniform threshold cover with P < 
2°( k ) 1-rectangles S = {R\, ■ ■ ■ , Rp} let h(x,y) denote the number of 
rectangles Ri the input x,y is included in. 

We know that for each x,y with f(x,y) = 1 there are at least s' 1- 
rectangles it is included in, so h(x,y) > s'. Each x,y with f(x,y) = is 
in at most es' 1-rectangles, hence h(x,y) < es' . 

Let \x be any balanced distribution on the inputs. We define a prob- 
ability distribution v on the 1-rectangles in S as follows. Each rectan- 
gle it! € S receives the weight J2 x ,y&R ^{ x iV)- We then normalize these 
weights to a distribution on 1-rectangles in S. The probability of some 
rectangle S is then 

X ^ R Ejy Ex'yei?' K x ', V') Ex'ye{o,i}«x{o,i}« v( x ', V') ' H x ', V 1 ) ' 

If we first pick a rectangle according to u and then on that rectangle 
an input (according to fj, restricted to R), we get some input x,y with 
probability 

v{x,y) ■ H x ,y) 

Ex'ye{o,i}™x{o,i}™ v( x ', v') ■ K x 'i y') ' 

So the weight of x, y in this experiment is proportional to fj,(x, y) ■ 
h(x,y). Hence the probability of picking a 0-input in this way is at most 

MCQ0)W < 3/4 -e^ 

MZ-Hi))^' - i/4 -s> - 3e ' {S) 

Assume that all rectangles that are larger than e 2 /P according to \i 
have error larger than 4e. Then, if we first pick a rectangle R and then 
an input x,y € R the probability that f(x,y) = 1 can be bounded as 
follows. The small rectangles contribute at most P ■ (e 2 /P) • 1 < e 2 to this 
probability. All larger rectangles have error 4e at least, and hence when 
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picking one of them the probability of getting a 1-input is at most 1 — 4e, 
so the overall probability of getting a 1-input is at most 1 — 4e + e 2 , a 
contradiction to (||). 

Hence there exists a 1-rectangle of size at least e 2 / P = Q(l/P) having 
error at most 0(e) according to u. 

When given a bounded error uniform threshold cover we can do the 
same construction for the 0-inputs, and hence UT s ^ s (f) = k allows us to 
find both a 1-rectangle and a O-rectangle with the desired properties for 
any balanced u. □ 

Proof of Theorem |3|, part 2. 

Assume bound^f) = h for some / : {0, l} n x {0, l} n -> {0,1}. 
Then boundfK(f) < 0(Hog n) using Lemma [l|. In other words, for each 

balanced distribution ji on {0, l} n x {0, l} n there exists a 1 — l/n 5 -correct 
1-rectangle of size at least s = 2~ ( fclo s ra ). 

We show how to construct a one-sided uniform threshold cover with 
parameters n,n 2 . The cover is produced by an algorithm. Let ui be the 
distribution which is uniform on the 1-inputs of / with probability 1/2 
and uniform on the 0-inputs of / with probability 1/2. 

1. Set I = 1, Cov = 0. 

2. Find a 1 — l/n 5 -correct rectangle Ri of size at least s according to n; 
and let Covi = Covi-\ U {Ri}. 

3. Let Io(l) denote the set of 0-inputs in Ri, let I±(l) denote the set of 
1-inputs in R[. 

4. Construct fi[ + \ be as follows: 

— the weight of all inputs in is reduced by a factor of 1 — 1/n 4 . 
The obtained "free" weight is used to increase the weights of all 
inputs in Io(l) by a fixed factor. 

— Any input in that is covered more than n 2 times receives 
weight 0. Its weight is used to increase the weight of all 1-inputs 
by a fixed factor. 

5. STOP if all 1-inputs are covered at least n 2 times. 

Note that each 0-input that is covered in some iteration I at least 
doubles its weight at that point. Namely, since a rectangle Ri has error 
1/n 5 , weight fJ-i(Ri) • (1 — 1/n 5 ) • 1/n 4 is distributed to inputs in Iq(1) of 
weight fii(Ri) • 1/n 5 , more than doubling the weight of each such input. 
Since no 0-input has weight more than 1 or less than l/2 2ra this implies 
that no 0-input is ever covered more than 2n times. 

We have to show that the distributions fj,i are all balanced for step 
2. to work. The second part of step 4. does not change the balancedness 
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of the distribution. In the first part of step 4. some weight is shifted from 
1-inputs to 0-inputs. The inputs in are reduced in weight by a factor 
of (1 — 1/n 4 ). But as soon as an input is covered n 2 times this reductions 
stops. 

Let us assume for the moment the following lemma justifying step 2., 
whose proof will be provided at the end of this section. 

Lemma 4. 1/2 > > 1/2 - 0(1/ n) for all I. 

The lemma clearly implies that the distributions m are all balanced, 
hence step 2. is applied correctly. We use Lemma |] only to ensure that the 
obtained cover is good on the 0-inputs, but not to ensure that it is good 
on the 1-inputs. To analyze the number of iterations of the algorithm we 
consider the following modification of step 2. 

2.' First strictly balance fii by uniformly increasing the weights of 1- 
inputs and decreasing the weights of 0-inputs by fixed factors. Then 
pick a size s rectangle Ri with error 1/n 5 according to that distribu- 
tion. 

The size of Ri on m is at least s ■ di, when di = u;(/ _1 (l))/(l/2) 
denotes the distortion of the balance of \i\ compared to Note that 
< 1/2, so d l < 1. 

We will show that the modified algorithm terminates and produces a 
size <9(n 3 /s) cover. This immediately implies that each 1-input is covered 
at least n 2 times. Then this is also true for the original algorithm: The 
original algorithm uses larger rectangles in step 2. and hence terminates 
faster. Furthermore we will show that Lemma ^ holds for both the mod- 
ified and the original algorithm. Note that, however, only the original 
algorithm guarantees that the cover is good on the 0-inputs. 

Let Si denote the set of 1-inputs not covered n 2 times before iteration 
I, and let Ni = \Si\. Since the weight of each input in Si is reduced in the 
first part of step 4. by a factor of (1 — 1/n ) each time it is covered, this 
decreases the weig ht of such an input by (l-l/n 4 ) n - 1 < (1 - 0(l/n 2 )). 
In the second part of step 4. the weights of all 1-inputs are increased by 
some fixed factor. Furthermore fj,\(x,y) = 1/(2N\) for all 1-inputs x,y. 
Hence for all x, y E Si and x' ,y' £ Si: 

(1 - 0(l/n 2 )) • fiiix', y') < w (x, y) < (1 + 0(l/n 2 )) • y'). 

The average weight of an input in Si is (l/(2iV/)) • d[. Hence the for all 
x,y e Si 

(l/(2Ni))di(l - 0(l/n 2 )) < fii(x,y) < (l/(2JV,))d,(l + 0(l/n 2 )). (4) 
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Let hi(x,y) denote the number of times input x,y is covered by 
Covi-i. There are N\ 1-inputs to /. Then let 

h(l)= V^l • Pos(n 2 - hfay)), 

(i,2/)e/- 1 (i) 

where pos(x) = x if x > and pos(x) = otherwise. h{l) denotes the 
average number of times 1-inputs still have to be covered. Clearly n 2 > 
h(l) > 0, and if h{l) > 0, then h(l) > 1/iVi. 

In each step 1-inputs of weight s(l — l/n 5 )di according to /U/ are cov- 
ered. Let C[ denote the set of 1-inputs x,y £ Ri with hi(x,y) < n 2 . Due 
to (|): mi(Cz) > sdi(l - 1/n 5 ) • (1 - 0(l/n 2 )) ■ (l/d^/Nx > a/2 ■ Ni/N x . 
Including Ri in the cover reduces h(l) by at least sNi/N\ hence. Then 

h(l + 1) < h{l) - sNi/Nt 

= !M • p° s ( n2 - hi?, v)) - WNi) ■ s 

{x,y)ef-l{l) 

= Wi > 2 -/i/(x,y)-s) 

< l/N 1 -{n 2 -hi(x,y))-(l-s/n 2 ) 

< h(l) ■ (1 - s/n 2 ). 

For some I = 0(n 3 /s) iterations h(l) = 0. Hence the constructed cover 
contains no more than 0(n 3 /s) rectangles. Since the algorithm with the 
original step 2. terminates at least as fast we have UT^ 1 \(f) < 0(A;logn). 

Given that UT s>t {f) = max{UT$ (f), C/T^J (-./)} we can simply do 
the same construction for the 0-inputs and get the desired result for 
bound(f). □ 

Proof of Lemma ^. First let us look at the algorithm with the 
modified step 2'. Let sidi denote the weight of Ri in /i/. As argued before, 
the algorithm stops as soon as JJ(1 — si/n 2 ) < l/(n 2 • N{). So there is a 
sequence s±, . . . , s/. so that Y\f =1 (l—si/n 2 ) < l/(n 2 -N\) and all si > s, and 
k is minimal with this property. The weight transferred to the 0-inputs is 
then at most J2t=i sidi/n 4 < J2t=i s il nA \ since no weight is transferred 
from We may hence adjust Sk so that nf=i(l — si/n 2 ) = l/(n 2 • JVi). 

For each k it is true that Ya=i s i i s maximized for s± = ■ ■ ■ = =■ s 
because: let s[ = (1 — si/n 2 ). Then FJ s[ = \/{n 2 Ni) and we want to 
maximize ^Z(l — s[n 2 ) = k — n 2 s i or equivalently minimize sj. This 
is achieved when s\ = ■ ■ ■ = Sk- 



29 



Then k = 0(n 3 /s) and the transferred weight is at most k • sjrft = 
0(l/n). Consequently the same holds if k is arbitrary. Note that this 
implies di > 1 — 0(l/n). 

In case the original step 2. of the algorithm is applied fii(Ri) = s; > s, 
and potentially a larger weight is transferred to the 0-inputs. But this 
also makes the algorithm terminate quicker. 

The algorithm stops at least when ]J(l — s//(^n 2 )) < l/(n 2 • N\). We 
may substitute s'{ = si/di and are left with the problem of finding the 
maximum of s 'i'di under the constraint that FJ(1 — s'{ /n 2 ) = l/(n 2 • N\) 
and s'l = si/di > s/di > s. This is the problem we have just analyzed. □ 

F Comparing the power of different threshold covers 

First let us show that one-sided bounded error uniform threshold covers 
for some function / can easily be converted into approximate majority 
covers. The same also holds for the complement of /. 

Proof of Theorem Assume that UT^ l Q t {f) = k. Then there ex- 
ist 2 k rectangles so that each 1-input is in at least 9t rectangles and 
each 0-input is in at most t rectangles. Now label all the rectangles as 
1-rectangles and add 3t times the 0-labeled rectangle covering all in- 
puts. This is clearly an approximate majority cover, hence APP(f) < 
UT t %(f) < 0(UT t %(f)). □ 

Now we relate APP(f) to a version of the rectangle size bound. 

Proof of Theorem Assume that APP(f) = k, then we can find 
2 k labeled rectangles making up an approximate majority cover for /. 
We first have to show that in this case for each balanced distribution \i 
there exists a 3/4-correct rectangle of size 1/2° W at least. The proof is 
analogous to the proof of Theorem [3|.l. a, but this time we are guaranteed 
to find a large rectangle with small error, not a large 1-rectangle with 
small error. To adapt the proof one has to replace the uniform threshold 
values s', es' by the expected correct height (1— e)-E[h(x, y)] and incorrect 
height e • E[h(x, y)). 

Now we show the opposite direction, namely, given that for each bal- 
anced distribution fx we can find a 3/4-correct rectangle of size l/2 fc , then 
we can construct an approximate majority cover. 

First notice that in fact we can find a rectangle of size l/2 k and 
error at most 1/4 for all distributions on the inputs, since on unbalanced 
distributions we may simply take {0, l} n x {0, l} n as a rectangle with 
error 1/4 when choosing the appropriate label for that rectangle. 
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To construct the approximate majority cover we first consider the fact 
that min At max 1 , size(/j,,e, f,v) > l/2 k in a somewhat different light. Let 

parain, R) = e/(3/ji(R)) + err(R, (j,, v(R)) ■ 2 k . 

This parameter controls the quality of a rectangle. Yao's application of 



the minimax-principle to randomized algorithms (see | KN97 |) provides us 
with the following statement. 

Lemma 5. The following two statements are equivalent for all f . 

1. For all distributions [/, there is a rectangle Rn with parameter a. 

2. There is a probability distribution D on rectangles so that for all dis- 
tributions fj> on inputs the expected parameter of a rectangle is a. 

The latter could be named a "randomized rectangle" because it re- 
sembles a randomized algorithm. We know that for all distributions /x on 
{0, l} n x {0, l} n there is a rectangle i?„ with size l/2 k and error e = 1/4, 
hence paraQi, R^) < (l/4)/(3/2 fc ) + (1/4) • 2 k = (1/3) • 2 k . The ran- 
domized rectangle then offers a distribution on rectangles with expected 
parameter (1/3) • 2 k at most. Hence the expected rectangle size satisfies 
(l/4)/(3E[n(R)}) < (1/3) • 2 k E[n(R)] > l/2 k+2 . The expected error 
satisfies E[err(R, fi,v(R))] • 2 k < (l/3)2 k E[err(R, fi,v(R))] < 1/3. 

A randomized rectangle immediately gives us an approximate major- 
ity cover for /, though not of the desired size. To see this note that if we 
consider a distribution fi x>y concentrated on some fixed input x, y, then 
Prob D (v{R) ± f(x,y)\x,ye R) = E[err(R, fi x , y ,v(R))] < 1/3. 

From the direct application of the Yao-principle we do not get a bound 
on the number of rectangles with nonvanishing probabilities used in D. 
We use the following discretization. 

Lemma 6. Assume there is a randomized rectangle for f with expected 
size s = l/2°( fe ) and expected (constant) error e < 1/3. Then there is an 
approximate majority cover for f having size 2°^ ■ n. 

The lemma clearly implies APP(f) < 0(k)+logn. So let us prove the 
lemma. We independently pick t = c- (1/s) -n rectangles R%,...,Rt from 
the distribution D for some large enough constant c. Our claim is that 
this yields the desired approximate majority cover. Let w(x, y) denote 
the random variable counting the number of Ri with label v(R) ^ f(x, y) 
and x, y G Ri- Let h(x, y) denote the number of all Ri with x, y £ Ri- 

Consider the distribution fj, x ^ y concentrated on x, y. We know that the 
expected size of a rectangle picked from D is at least s. Since /j, x>y (R) 6 
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{0,1}, with probability at least s a chosen rectangle contains x,y. So 
E[h(x, y)} > s ■ c • (1/s) • n = cn. 

We know E[w(x, y)] < eE[h(x, y)], and want to bound Prob(w(x, y) > 
1.1 • eE[h(x,y)]), which is maximized if E[w(x,y)} is as large as possible, 
hence we assume E[w(x, y)\ = eE[h(x,y]. Using the Chernov bound 

Prob(w(x,y) > 1.1 • eE[h(x, y)]) 
= Prob(w(x,y) > 1.1 • E[w(x, y)]) 

< e -E[w(x,y)]/300 = e -eE[h(x,y)]/300 < g - ecn/ '300 _ 

Let c = 0(l/e) be large enough, so that the above probability is at most 
2-2n-i Then the probability that there exists one of the 2 2n inputs x,y 
with w(x,y) > 1.1 • eh(x,y) is smaller than 1. Consequently there exists 
a choice of i rectangles so that for all x,y: w(x,y) < 1.1 • eh(x,y). 

By Remark ^ we get an approximate majority cover. □ 
We now show an exponential gap between (even one-sided) bounded 
error uniform threshold covers and approximate majority covers. 

Proof of Theorem Consider the function BOTH : ({0, l} 2ri x 
{0, 1}) x ({0, l} 2n ) -► {0, 1} defined as follows: 

BOTH((x 1 ,x 2 ,a),(y 1 ,y 2 )) = {DISJ{x x , yi) A a) V (-^DISJ(x 2 , y 2 ) A-a). 

Hence depending on a the function either computes DISJ on the first 
pair of inputs or -^DISJ on the second pair. 

First we show that APP(BOTH) = O(logra). Note APP(^DISJ) = 
O(logn), since N{^DISJ) = O(logn) and APP(f) < N(f). Hence also 
APP(DISJ) = O(logn), since APP(f) = APP(->f) for all /. To find an 
approximate majority cover for BOTH we take the approximate majority 
cover for DISJ and intersect all its rectangles with the rectangle defined 
by a = 1. We also take the approximate majority cover for -^DISJ and 
intersect all its rectangles with the rectangle defined by a = 0. The union 
of these sets of rectangles is an approximate majority cover for BOTH. 
So APP(BOTH) = O(logn). 

Now we consider UT^^BOTH). We consider a distribution on in- 
puts in which a = 1 with probability 1. In this case with probability 
1, BOTH((x\, x 2 , a), (yi, y 2 )) = DISJ{x\,y\). Since there is a balanced 
distribution on inputs so that each 1-rectangle either has size l/2«(») or 
error at least e for some constant e > (see Fact |]), we can choose this 
distribution on the input positions Xi,x 2 , and fix x 2 ,y 2 arbitrarily. In this 
way we get a balanced distribution with the same properties for BOTH 
and hence UT^^BOTH) = Q(n) (using Theorem ||). 
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Now we consider UT^^BOTH). We may proceed as above, by 
fixing a = and considering the quality of 1-rectangles for —>(—>DISJ). 
So we get UT^ s (-^BOTH) = Q(n). □ 

Proof of Theorem [7|. It is easy to construct a majority cover for 
MAJ . The cover contains n 1-rectangles defined by Xi A yi plus [n/2] 

0- rectangles covering {0, l} n x {0, l} n . If we have MAJ{x,y) = 1, then 
at least [n/2] 1-rectangles x% A y% contain x,y, else at most [n/2] — 1 

1- rectangles contain x, y. 

For the lower bound we have to argue that there is a balanced dis- 
tribution for which all 1 — e-correct rectangles have size at most l/2^( n ). 
Let n' = 6k + 2 be the input length for some k satisfying k = 1 mod 2 
and k = 1 mod 3. 

First we fix 2k variables Xi,yi = 1. There are n = Ak + 2 remaining 
variables. MA J(x, y) = 1 <^=^ Ya=\ x -i /\ y -i > k + 1 under this fixing. 
We pretend in the following that there are n variables. 

Let us define the distribution. Let fi r be the uniform distribution 
on {(x,y) : \x\ = \y\ = n/2,\x n y\ = k}, and let fi a be the uniform 
distribution on {(x,y) : \x\ = \y\ = n/2, \x n y\ = k + 1}. Then let u be 
defined by fJ,(x,y) = (3/4) • fi r (x,y) + (1/4) • fj, a (x,y). The distribution is 
obviously balanced. 

We have to show that there are no large rectangles with small error, 
neither 1-rectangles nor O-rectangles. This is handled in the following way. 

Claim. If there is a 1-rectangle of size f2(s) and error 0(5) according 
to [i and MAJ, then there is a 0-rectangle of size f2(s) and error 0(5) 
according to u and MAJ, and vice versa. 

Proof of the claim. Assume there is a size s 1-rectangle R = A x B 
with a fraction of (1— S)s 1-inputs and 5s 0-inputs on u. Let ->A = {x : x £ 
^4}. We claim that -<AxB is an 0(5)-error size f2(s) 0-rectangle. Note that 
f(x,y) = <^=^ \xr\y\ = k and f(x, y) = 1 <^=^ \xf]y\ = k + 1 under ^, 
and that \y\ = 2k + l. Hence \xCiy\ = \y\ — \xHy\ = 2k + l — k — f(x,y), 
and so f(x,y) ^ f(x,y) with probability 1. So the rectangle -*A x B has 
entries with reversed function value compared to Ax B. The claim follows 
with (jl(x, y) = &{fi{x, y)). □ 

We are going to show that each 0-rectangle has size at most 2 _ ^( ra ) 
or has error e for some constant e. Then APP{M A J) = Q(n). 

We consider the following way to choose inputs according to a: First 
we choose a frame, namely a partition of {1, . . . , n} into sets of size k, 
z x ,z y of size (4/c + 2 — k — l)/2 = [3/c/2], and {i} of size 1, uniformly 
under all such partitions. Then x is chosen to contain all of z^ and with 



33 



probability 1/2 also {i}. x is filled up to a size 2k + 1 set by choosing 
uniformly elements of z x . y is chosen similarly, only with the filling up 
done from the set z y . Note that this produces the distribution \i. 

Now we fix Zk arbitrarily Let [i Zk denote the corresponding distribu- 
tion on inputs. Ignoring the variables in z^ the players choose sets with 
an intersection size in {0, 1}, i.e., they solve the (complement of the) 
disjointness problem on a specific distribution. 

We employ Fact |6| at this point. A technical problem for the applica- 
tion of this fact is that for fj, Zk subsets of size k + 1 are chosen from a size 
n — k = 3k + 2 universe. To overcome this we may fix arbitrary disjoint 
subsets s x , s y C {1, . . . , n} — z^ of size I = k/3 + 2/3 each. The variables 
in s x are set to 1 in x and the variables in s y are set to 1 in y. After 
fixing z^jSxjSy an input is chosen as follows. First z x and z y are chosen, 
under the condition that they include s x resp. s y , hence the remaining 
size of these is [~3/c/2] — \k/3~\ each. Then {i} is chosen and the frame 
is complete. Afterwards an input is chosen as before. Call the resulting 
distribution fJ, ZkjSxjSy . 

The number of remaining nonfixed variables when choosing according 

to 

^z k ,s x ,s y is n" — 3k + 2 — /. Disregarding the fixed I elements the size of 
x and of y is k + l — l = (2/3) k + 1/3 = n"/4. So disregarding the fixed 
inputs we have reached the distribution v of Fact ||. 

Under /i the weight of any input x, y can be expressed as the expec- 
tation over all possibilities to fix z^ and to fix s x , s y of the weight of the 
input under this fixing. Namely, 

fj,(x,y) = E Zk)SxySy [ii Zk)SxySy {x,y)}. 

We know from Fact ||] that for all Zk,s x ,s y : 

ti ZM {MAJ-\l) HR)> 1/5 • //,...,,!/.'! - 2-"W 



Hence also fj,(R n MAJ' 1 ^)) 

= E ZMy [v Zk , SxjSy (MAJ- 1 (l)nR)] 
> (1/5) • E Zk>Bw>Bv \p Zk>Bw>Bv (R)] - 2- n ^ 
= (l/5)-n(R)-2-^ n \ 

Hence any 0-rectangle for MAJ under \i either has size 2~ Qf - n \ or error 
1/5. Due to our previous claim within constant factors the same holds for 
1-rectangles. So the lower bound APP(MAJ) = Q(n) follows. □ 
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